Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms
نویسندگان
چکیده
RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.
منابع مشابه
بررسی ترنسکریپتوم و تخمین بیان ایزوفرمهای سه ژن از مسیر پیامرسانی PI3K و FGFR در سرطان مثانه
Background: Aberrant pre-mRNA alternative splicing is a common event in cancer cells. Many abnormally spliced RNA variants have been observed in tumor cells and they can be used as biomarkers or therapeutic targets in new drug design. Increasing our knowledge in understanding the mechanisms of alternative pre-mRNA splicing for cancer-related genes and determination of cancer specific isoforms a...
متن کاملInFusion: Advancing Discovery of Fusion Genes and Chimeric Transcripts from Deep RNA-Sequencing Data
Analysis of fusion transcripts has become increasingly important due to their link with cancer development. Since high-throughput sequencing approaches survey fusion events exhaustively, several computational methods for the detection of gene fusions from RNA-seq data have been developed. This kind of analysis, however, is complicated by native trans-splicing events, the splicing-induced comple...
متن کاملCharacterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing
We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast canc...
متن کاملHigh throughput RNA sequencing utility for diagnosis and prognosis in colon diseases
RNA sequencing is the use of high throughput next generation sequencing technology to survey, characterize, and quantify the transcriptome of a genome. RNA sequencing has been used to analyze the pathogenesis of several malignancies such melanoma, lung cancer, and colorectal cancer. RNA sequencing can identify differential expression of genes (DEG's), mutated genes, fusion genes, and gene isofo...
متن کاملOptimized approach for Ion Proton RNA sequencing reveals details of RNA splicing and editing features of the transcriptome
RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different ty...
متن کامل